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Abstract 

We consider a random graph Q(n,p) whose vertex set V has been randomly 
embedded in the unit square and whose edges are given weight equal to the geometric 
l/-) • distance between their end vertices. Then each pair {u, v} of vertices have a distance 



(N 

in 



in the weighted graph, and a Euclidean distance. The stretch factor of the embedded 
graph is defined as the maximum ratio of these two distances, over all {u, v} C V . 
We give upper and lower bounds on the stretch factor (holding asymptotically almost 
surely), and show that for p not too close to or 1, these bounds are best possible in 
a certain sense. Our results imply that the stretch factor is bounded with probability 
tending to 1 if and only if n(l — p) tends to 0, answering a question of O'Rourke. 

1 Introduction 

Let G be a graph embedded in the plane. For every two points u and v, let d(u, v) denote 
their Euclidean distance. Make G weighted by putting weight d(u,v) on every edge uv. 
For two vertices u and v, let da(u,v) denote their shortest-path distance on (weighted) 
G. The stretch factor of G is defined as 

d G (u,v) 

max — , 

d{u, v) 
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where the maximum is taken over all vertices u, v. If G is disconnected then its stretch 
factor is undefined. 

The stretch factor (also known as the spanning ratio or the dilation) is a well studied 
parameter in discrete geometry, see for instance the book [6] or the recent survey [2]. An 
important problem in this context is the following. Given n points on the plane, find a 
set of 0{n) pairs of them, such that when you create a geometric graph by adding the 
segments joining the points in each pair, this geometric graph has bounded stretch factor. 
A possible approach is to choose a random set of pairs. Suppose that we randomly choose 
M distinct pairs from the set of all (™) pairs of points, and add the corresponding edges. 
Then, one can ask, how large should M be to guarantee that the stretch factor is bounded, 
with probability tending to 1? In this paper we show that if the initial points are chosen 
uniformly at random from the unit square, then we need almost all edges to guarantee a 
bounded stretch factor, hence this method is inefficient. 

The setting is as follows. Select n points uniformly at random from the unit square, and 
then form a random geometric graph G on these points by joining each pair independently 
with probability p, where p is in general a function of n. This is not a "random geometric 
graph" in the sense of Penrose [7], because points are joined without regard to their 
geometric distance. However, one can call this a randomly embedded random graph, since 
you get the same thing if you start from an Erdos-Renyi random graph with parameters 
n,p and embed each of its vertices into a random point in the unit square. The stretch 
factor of G is a random variable and we denote it by F{n,p). We study the asymptotic 
behaviour of .F(n,p) when n is large, and give probabilistic lower and upper bounds for 
it. In the following, asymptotically almost surely means with probability 1 — o(l), where 
the asymptotics is with respect to n. 

In the open problem session of CCCG 2009 [3], O'Rourke asked the following question: 
for what range of p is J-"(n, p) bounded asymptotically almost surely? As a conclusion of 
our bounds, we answer this question as follows. Let A > 1 be any fixed constant, and 
note that w(l) denotes a function that tends to infinity as n grows. If n(l — p) — ou(l), 
then asymptotically almost surely JF(n,p) > A. If n(l — p) — 6(1), then T{n,p) > A 
with probability Finally, if n(l — p) — o(l), then asymptotically almost surely 

JF(n,p) < A. 

Our main lower bound is the following theorem. 
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Theorem 1. Let w(n) = ou(l). Then asymptotically almost surely 

\/n{l-p) 



win) 



Let A be fixed. This theorem implies that if n(l — p) = then asymptotically 

almost surely > A. This strengthens the result of the first author [5], who proved 

the same thing for p < 1 — 

Let CON denote the event "G is connected." Recall that if the graph is disconnected, 
then its stretch factor is undefined. For any p < 1, this happens with a positive probability, 
hence ElJ^n,^)] is undefined. It is then natural to bound 'E[J r (n,p)\CON] instead. 

Our main upper bound is the following theorem. 

Theorem 2. Let p 2 n > 331ogn and let w{n) = uj(1). Then, asymptotically almost surely 
we have 

, win) a/ 'nil — p) 

F(n,p) < 1 + V ;V V — . 

p 

If p 2 n > 113 log n, then 



E [T(n,p)\ CON] < 1 + V™8n(l-p) + 

p 

When n(l — p) — o(l), this theorem implies that for any fixed e > 0, we have that 
E [J r (n : p)\CON] < 1 + e, and asymptotically almost surely ^(n^p) < 1 + e. 

The more interesting case is when n(l — p) — 0(1). In this regime Theorem 2 states 
that E [J r (n,p) \ CON] = 0(1). So, one may wonder if there is a constant A such that 
asymptotically almost surely ^(n, p) < A. However, Lemma 8 (which is the main lemma in 
the proof of Theorem 1) implies that this is not the case: for any fixed A, with probability 
we have CON and J-"(n,p) > A. In other words, the random variable J-"(n,p) is not 
concentrated. In this case one might expect that the distribution of F(n, p) tends to some 
nontrivial limit if n(l — p) is constant. 

Lemma 8 actually implies that for a wide range of p, the first conclusion of Theorem 2 
is tight, in the sense that w{n) cannot be replaced with a constant. Namely, the following 
is true. 

Theorem 3. Assume that p = f2(l) and n(l — p) — There is no absolute constant 

C for which asymptotically almost surely 



Cy/n(l-p) 

•F{n,p) < — h 1 . 

p 
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There is a nontrivial gap between our lower and upper bounds when p — o(l). It 
remains open to determine which of the bounds are closer to the correct answer in this 
regime. 

The following notation will be used in the rest of the paper. For a point Q and 
nonnegative real R, C(Q, R) denotes the intersection of the disc with centre Q and radius 
R and the unit square, and F simply denotes J-"(n,p). We often identify each vertex with 
the point it has been embedded into. All logarithms are in natural base. 

2 The Lower Bound 

In this section we prove Theorems 1 and 3. First we need an easy geometric result. 

Proposition 4. Let Q be a point in the unit square. If < R < 1/2, then C(Q,R) has 
area at least txR 2 /A. IfO < R < \[2, then C(Q,R) has area at least irR 2 /32. 

Proof. By symmetry, we may assume that Q lies in the upper left quarter of the unit 
square. If < R < 1/2, then the bottom right quarter of the disc with centre Q and 
radius R lies completely inside the unit square, and hence the intersection area is at least 
ttR 2 /A > vri? 2 /32. If 1/2 < R < x/2, then C{Q,R) contains C(Q, 1/2), so its area is at 

].6clStj 

tt(1/2) 2 _ _7T_ _ tv(V2) 2 > ttR 2 
4 ~~ 16 ~~ 32 ~ ~32~ ' 

Let c be such that 1/51 < c < l/16n and cn is an even integer. Notice that here, as 
in the rest of the paper, we always mean l/(ab) when we write 1/ab. 

Lemma 5. Choose cn points independently and uniformly at random from the unit square. 
Build a graph H on these vertices, by joining two vertices if their distance is at most 2/ 1 y/n. 
With probability at least 1 — 0(l/n), H has at least cn/2 isolated vertices. 

Proof. Let X be the number of edges of H. Then we have X = J2i<j where X iy j is 
the indicator variable for the distance between vertices i and j being at most 2/ yfn. Note 
that if vertex % has been embedded in point p iy then X i; j = 1 if and only if vertex j is 
embedded in C(p i) 2/y/n). So by Proposition 4 we have 

n/n < E [Xi d ] <47i/n. 
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Let g = E [X^], q 2 = 4vr/n, and M = ( c ™). Thus 

E [X] = qM = 0(n) . 



We claim that Var [X] = 0{n). Note that if {i,j} and {k,l} are two disjoint sets of 
vertices, then 

E [XijX k j] = E [Xij] E [X k; i] = q 2 , 

and the number of such pairs of pairs equals ( c ™) ( c ™ 2 ~ 2 ) < M 2 . Otherwise, let j = I. Then 
for Xij = Xkj = 1 to happen, both vertices i and k should be embedded at distance at 
most 2/ y/n from where vertex j has been embedded, the probability of which is not more 
than q\. Hence in this case 

E [XijX kt i] < q\ , 

and the number of such pairs of pairs is not more than (cn) 3 . Consequently, 

E [X(X - 1)] = Yl E i X iM < M \ 2 + (c") 3 9l > 
{W{k,i} 

and so 

Var [X] = E [X(X - 1)] + E [X] - E [X] 2 < M 2 q 2 + (cnfq 2 2 + qM - q 2 M 2 = 0(n) . 



By Chebyshev's inequality, 

Pr[X>2E[X]]<^^l = 0(l/n). 

Thus with probability 1 — 0(l/n), H has at most 2E [X] < Ac 2 nn edges. If this is the 
case, then it has at least 

cn — 8c 2 nn > cn/2 

isolated vertices, and this completes the proof. ■ 



Now, back to the main problem. Assume that the n vertices are embedded one by 
one, and the edges are exposed at the end. Consider the moment when exactly cn vertices 
have been embedded. Build an auxiliary graph on these vertices, by joining two vertices if 
their distance is at most 2/ y/n. By Lemma 5, with probability 1 — 0(l/n), this graph has 
at least cn/2 isolated vertices. We condition on the embedding of the first cn vertices such 
that this event holds. Let A be a set of cn/2 isolated vertices in this graph. The vertices 
in A are called the primary vertices, the vertices that are one of the first cn vertices but 
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are not in A are called far vertices, and the vertices that have not been embedded yet are 
called the secondary vertices. 

Let m = cn/2 be the number of primary vertices, and n' = n(l — c) be the number 
of secondary vertices. A primary disc is a set of the form C(v, 1/y/n), where v G A; 
the vertex v is called the centre of the primary disc. So, we have m primary discs, say 
1Zi,1Z2, • • • , 7Z m . Let W be the set of points of the unit square that are not contained in 
any primary disc. Notice that by the definition of A, the primary discs are disjoint, and 
no far vertex is contained in any primary disc. 

Consider the following process for embedding the secondary vertices and exposing all 
of the edges of G: 

1. Note that {W, 72. 2 , • • • , Tim} is a partition of the unit square. In the first phase, 
to each secondary vertex, independently, we randomly assign an element of the 
partition, with probability proportional to the area of that element. Thus for each 
primary disc it is known how many secondary vertices it contains, but their exact 
position is not known. 

2. In the second phase, for each secondary vertex we choose a random point in the 
corresponding element, and place the vertex at that point. 

3. In the third phase, for every pair of vertices we add an edge independently with 
probability p. 

Clearly this process generates random geometric graphs with the same distribution as 
before, however it makes the analysis easier. 

Lemma 6. With probability 1 — exp (— fl(n)), after the first phase, there exist at least 
e~ 8 m primary discs containing exactly two vertices: one primary (the centre) and one 
secondary. 

To prove this lemma we will use the following large deviation inequality, which is 
Corollary 2.27 in Janson et al. [4]. 

Proposition 7 ([4]). Let Z\, Z 2 , . . . , Z n be a sequence of independent random variables, 
and suppose that the function f satisfies 

\f(x 1 ,x 2 ,...,x n ) - f(yi,y 2 ,. ■ ■ ,y n )\ < c, 
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whenever the vectors (x±,X2, ■ ■ ■ , x n ) and (yi, y2, ■ ■ ■ , y n ) differ only in one of the coordi- 
nates. Then, 

Pr[/(Z!, Z 2 , . . . , Z n ) - E[/(Zx, Z 2 , . . . , Z n )] < -i] < exp 

Proof of Lemma 6. Recall that we have n' secondary vertices, and to each of them, in- 
dependently, we randomly assign an element of the partition {W, 72-2, • • • , TZm}, with 
probability proportional to the area of that element. By Proposition 4, the area of each 
primary disc is between ir/4n and n/n; so for every 1 < % < m, the probability that TZi 
contains exactly one secondary vertex is at least 

i j - (l - -) > — (exp(-27r/n)) B > ^ exp (-2n'n/n) > - exp(-2vr) , 

as n' > n/2. Let pi = 7rexp(— 2n)/8. 

Let X be the number of primary discs that contain exactly one secondary vertex. 
Since every primary disc contains exactly one primary vertex (its centre), to prove the 
lemma we need to show that we have X > e~ 8 m with probability 1 — exp (— Q(n)). The 
calculation above shows that E [X] > mpi. Since p 1 > 2e~ 8 , showing that 

Pr [X < E [X] /2] < exp (-ft(ra)) 

completes the proof of the lemma. 

To every secondary vertex v we assign a variable Z v , which equals k if v is assigned to 
IZk in the first phase, and equals if v is assigned to W. Since the assignment in the first 
phase is done independently, the random variables {Z v : v secondary} are independent. 
Moreover, changing the value of Z v for a single vertex will change X by at most 2. Hence 
by Proposition 7 we have 

Pr [X < E [X] /2] < exp ( ^^ J < exp {^f) = exp (-Q(n)) . 

Therefore, with probability 1 — exp(— fl(n)), there exist at least e _8 m primary discs 
containing exactly two vertices. ■ 

A primary disc 1Z containing exactly two vertices u and v is called nice if the distance 
between u and v is less than l/X^/n, and u and v are not adjacent in G. We claim that 
the existence of a nice primary disc TZ implies that the stretch factor of G is larger than A. 
To see this, assume, by symmetry, that u is the vertex of the centre of 1Z. The (weighted) 
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distance between u and v in G is at least l/y/n, since any (u, i>)-path in G must go out 
of TZ at the very first step. However, the Euclidean distance between u and v is at most 
1/Xy/n, and we have 

d G (u,v) 



d(u, v) 



> A. 



Theorem 1 follows immediately from the following lemma. 
Lemma 8. For any positive A we have 

cn(l — p) 



Pr [F(n, p) < A] < exp 



2e 8 A 2 



+ o(l). 



Proof. Consider a primary disc 72. such that after the first phase, it has been determined 
that TZ contains exactly two vertices, u and v, where u is the centre of TZ. Then for TZ to 
be nice, in the second phase v should be placed in C(u, 1/Xy/n), and in the third phase u 
and v should become nonadjacent in G. The probability of the former is at least 

even if the disc of radius 1 / y/n centred at u is not wholly contained in the unit square, and 
the probability of the latter is 1 — p. These two events are independent, so the probability 
that TZ is not nice is at most 1 — (1 — p)X~ 2 . 

By Lemma 6, once the first phase finishes, with probability 1 — exp(— Q(n)) there 
exists a set B of at least e~ 8 m primary discs, such that each primary disc in B contains 
exactly two vertices. We condition on this event in the following. The crucial observation 
is that the events happening during the second and third phases inside each primary disc 
in B are independent of the others. In particular, the events 

{TZ is nice : TZ G B} 

are mutually independent; hence the probability that none of the primary discs in B 
become nice during the second and third phases, is at most 

cn(l — p) 



[1-(1-P)X~Y m <exp 



2e 8 A 2 



so that 



Pr < A] < exp 



cn(l — p) 
2e 8 A 2 



+ 0(l/n) +exp(-ft(n)) . 



Theorem 3 follows from Lemma 8 by putting A = C' \/n(\ — p) for a suitable constant 
C", noting that p = £1(1) and n(l — p) — £1(1). 
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3 The Upper Bound 



In this section we prove Theorem 2. We will use the following version of the Chernoff 
bound. This is Theorem 2.1 in Janson et al. [4]. 

Proposition 9 ([4]). Let X — Xi + • ■ • + X, m , where the Xi are independent identically 
distributed indicator random variables. Then for any e > 0, 



Proof. Say a pair (u,v) of vertices is bad if dc(u,v) > (2A + l)d(u,v). Let u and v 
be arbitrary vertices. First, we show that with probability at least 1 — exp(— p 2 n/16), 
u and v have at least p 2 n/4 common neighbours. The expected number of common 
neighbours of u and v is p 2 {n — 2) > p 2 n/2, and since the edges appear independently, 
by Proposition 9, the probability that u and v have less than p 2 n/4 common neighbours 
is less than exp(— p 2 n/16). In the following, we condition on the event that u and v have 
at least p 2 n/A common neighbours. 

Now, consider the random embedding of the graph. For any t > 0, if u and v are 
adjacent, or if d(u,v) > t and u and v have a common neighbour w with d{u,w) < \t, 
then we would have dc(u,v) < (2A + \)d{u,v) so the pair is not bad. To give an upper 
bound for the probability of badness of the pair, we compute the probability that u and 
v are nonadjacent, and ih < d(u,v) < (i + l)h, and they have no common neighbour w 
with d(u,w) < \ih, and sum over i. 

Let us condition on the embedding of vertex u, and denote by a(u, s) the area of the 
set of points in the unit square at distance at most s from u, and let q — 1 — p. Then 
since u and v have at least p 2 n/4 common neighbours and these common neighbours are 
embedded independently, for any h > 



Note that lim^ <^ih) ^ 27rih and algQ for ^ > y^/A the summand is zero. 

Hence, letting t = ih and taking the limit, 



Pr [X < (1 - e)E [X]] < exp(-e 2 E [X] /2) . 



Lemma 10. For any positive X, 




[2/h\ 

Pr [(u, v) bad] < V K M > (* + l ) h ) ~ °( M > ih )\ (! ~ °( M > A^)) p2 " /4 . 



Pr[(u,v) bad] < 
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By Proposition 4, a(u, At) > 7r(At) 2 /32. Hence 



128g 



Pr [(„, ,) bad] < J 2n t t(l - , { Xtf^ Y At = - ^ . ^ 



V2/X 



t=0 



128g 



1-1 



. i i V_n 



(p 2 n + 4)A 2 
So by the union bound 

Pr [F > 2A + 1] = Pr [3 a bad pair] < n 2 
We are now ready to prove Theorem 2. 



< 



128g 
p 2 n\ 2 



cxp 



p 2 n\ t 128(1 -p) 
p 2 n\ 2 



16 



+ 



Proof of Theorem 2. We need to show that if p 2 n > 33 log n and tu(ra) 
asymptotically almost surely we have 



then 



p < w{n)y/n{l-p) 1 
P 



and that if p 2 n > 113 log n, then 



ElF] CON]<l + ^^EA 



For the first part, let A 



w(n)yj n(l—p) 



2p 



p 



By Lemma 10, 



o(l) 



Pr 



F > 



w{n)^n{\ -p) i 



p 



= Pr [F > 2A + 1] < n 2 exp -— 4 



p 2 n\ 128ra(l — p) 



16 



p 2 \ 2 



= o(l). 



Thus, asymptotically almost surely F < w( - n W n ( 1 _g) _|_ j__ 



For the second part, let e = y 12 ^ — ■ Since F is nonnegative, we have 

POD 

[F\ CON] = / Pr[F > s\ CON] ds 

Js=0 

pl+e rn 5 poo 

= / Pr[F > s\ CON] ds + Pr[F > s\ CON] ds + Pr[F > s\ CON] ds. 

Js=0 Js=l+e Js=n 5 
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Clearly, 

-l+£ 

Pr[F > s\ CON] ds < 1 + e . 

's=0 



Assuming the graph is connected, for any pair {u, v} of vertices there is a path having 
at most n — 1 edges joining them, so da(u,v) < ny/2. Hence, if F > s, then there exists 
a pair {u,v} of nonadjacent vertices with d(u,v) < n\[2js. Let u be a fixed vertex. Let 
A u be the event that there exists a vertex, not adjacent to u, at distance less than n\/2/s 
from u. By the union bound, PrL4J < 27m 3 (l— p)/s 2 . The probability that there exists a 
vertex u for which A u happens is by the union bound less than 27m 4 (l — p)/s 2 . Therefore, 

f°° Pr[F > s\ CON] ds < ^(l-P) ds = o( i). 

J s=n 5 J s=n 5 * 

We will now bound the second term in the right hand side of (1). Notice that for any 
fixed embedding of the vertices, the event U F > A" is a decreasing property (with respect 
to the edges in the graph), and the event U G is connected" is an increasing one. Hence 
by the correlation inequalities (see, e.g., Theorem 6.3.3 in Alon and Spencer [1]) we have 

Pr [F > A| CON] < Pr [F > A] . 
Let A = (s - l)/2. Then by Lemma 10 
Pr [F > s] < n 2 exp 



p 2 n\ 128n(l — p) 



16 J ' p 2 X 2 



Therefore, since p 2 n > 113 log n, 

-.5 fn 5 /2 / 2 



P Pr [F > s] ds < P 2n 2 exp ( + 

Js=l+e J\=e/2 V 16 / 

< n 7 exp ( 



p n\ 256n(l — p) 
' dX 



p 2 \ 2 



pn\ — 256n(l — p) 



16 ) p 2 \ 

, x 512n(l -p) 
< o{\) + \ ^ = oil) + e . 



n 5 /2 
A=e/2 



Consequently, 

-l+£ 



E[F] COA^] = / Pr[F > s\ CON] ds + Pr[F > s\ CON] ds + Pr[F > s\ CON] ds 

Js=0 Js=l+e Js=n 5 

v/2048n(l -p) 

< l + 2e + o(l) = 1 + ^ i — + o(l). ■ 
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